The Enterprise Big Data Lake by Alex Gorelik

The Enterprise Big Data Lake by Alex Gorelik

Author:Alex Gorelik [Alex Gorelik]
Language: eng
Format: epub
Publisher: O'Reilly Media, Inc.
Published: 2019-03-11T04:00:00+00:00


Establishing Trust

Once an analyst finds the pertinent data set, the next question becomes whether the data can be trusted. While analysts sometimes have the luxury of access to clean, trusted, curated data sets, more often than not they have to independently ascertain whether they can trust the data. Trust is usually based on three pillars:

Data quality—how complete and clean the data set is

Lineage (aka provenance)—where the data came from

Stewardship—who created the data set, and why



Download



Copyright Disclaimer:
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.